Ameliorated de novo transcriptome assembly using Illumina paired end sequence data with Trinity Assembler
نویسندگان
چکیده
Advent of Next Generation Sequencing has led to possibilities of de novo transcriptome assembly of organisms without availability of complete genome sequence. Among various sequencing platforms available, Illumina is the most widely used platform based on data quality, quantity and cost. Various de novo transcriptome assemblers are also available today for construction of de novo transcriptome. In this study, we aimed at obtaining an ameliorated de novo transcriptome assembly with sequence reads obtained from Illumina platform and assembled using Trinity Assembler. We found that, primary transcriptome assembly obtained as a result of Trinity can be ameliorated on the basis of transcript length, coverage, and depth and protein homology. Our approach to ameliorate is reproducible and could enhance the sensitivity and specificity of the assembled transcriptome which could be critical for validation of the assembled transcripts and for planning various downstream biological assays.
منابع مشابه
Compacting and correcting Trinity and Oases RNA-Seq de novo assemblies
BACKGROUND De novo transcriptome assembly of short reads is now a common step in expression analysis of organisms lacking a reference genome sequence. Several software packages are available to perform this task. Even if their results are of good quality it is still possible to improve them in several ways including redundancy reduction or error correction. Trinity and Oases are two commonly us...
متن کاملClustering of Short Read Sequences for de novo Transcriptome Assembly
Given the importance of transcriptome analysis in various biological studies and considering thevast amount of whole transcriptome sequencing data, it seems necessary to develop analgorithm to assemble transcriptome data. In this study we propose an algorithm fortranscriptome assembly in the absence of a reference genome. First, the contiguous sequencesare generated using de Bruijn graph with d...
متن کاملTedna: a transposable element de novo assembler
MOTIVATION Recent technological advances are allowing many laboratories to sequence their research organisms. Available de novo assemblers leave repetitive portions of the genome poorly assembled. Some genomes contain high proportions of transposable elements, and transposable elements appear to be a major force behind diversity and adaptation. Few de novo assemblers for transposable elements e...
متن کاملIllumina-Based De Novo Transcriptome Analysis and Identifications of Genes Involved in the Monolignol Biosynthesis Pathway in Acacia koa
Corresponding Author: Dulal Borthakur Department of Molecular Biosciences and Bioengineering, University of Hawaii, Honolulu, HI 96822, USA Email: [email protected] Abstract: Acacia koa is a leguminous timber tree endemic to the Hawaiian Islands. For breeding projects involved in improving wood quality of A. koa, understanding of genes influencing wood quality is crucial. Therefore, the objectiv...
متن کاملDe novo transcriptome assembly with ABySS
MOTIVATION Whole transcriptome shotgun sequencing data from non-normalized samples offer unique opportunities to study the metabolic states of organisms. One can deduce gene expression levels using sequence coverage as a surrogate, identify coding changes or discover novel isoforms or transcripts. Especially for discovery of novel events, de novo assembly of transcriptomes is desirable. RESUL...
متن کامل